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We present a measurement of the ratio of the ti production cross section via gluon-gluon fusion 
to the total ti production cross section in pp collisions at ^/s = 1.96 TeV at the Tevatron. Using 
a data sample with an integrated luminosity of 955 pb^^ recorded by the CDF II detector at Fer- 
milab, we select events based on the ti decay to lepton+jets. Using an artificial neural network 
technique we discriminate between ti events produced via qq annihilation and gg fusion, and find 
Gf =a{gg — » ti)/a{pp —> ti) < 0.33 at the 68% confidence level. This result is combined with a 
previous measurement to obtain the most stringent measurement of this quantity by CDF to date, 
Gf = Omtlfr- 
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In hadron colliders, such as the Tevatron, the pair pro- 
duction of heavy quarks has contributions from the dif- 
ferent partons present in the initial state hadrons. While 
for a given quark flavor the total pair production cross 
section can be measured simply by counting events in 
specific final-state channels, the contribution from the 
different primary partons to this cross section is difficult 
to estimate. For beauty, charm, and light-flavor produc- 
tion, the hadronization process does not normally allow 
the spin and kinematic properties of a quark to be ob- 
served through the analysis of the final state particles. 
The situation is different for ti production. The top 
quark, with a mass of about 175 GeV/c^, has a lifetime 
that is an order of magnitude shorter than the typical 
hadronization time of«5xl0~^''s[l|. As a consequence, 
the spin and kinematic information of the top quark are 
preserved in its decay products, allowing the different ti 
production processes to be distinguished based on the 
kinematic characteristics of the final state particles. 

The standard model (SM) predicts the tt production 
processes to be qq annihilation (qq -^ tt) and gg fusion 
{gg — > tt), occurring at the Tevatron with relative frac- 
tions of ~85% and ^15%, respectively, and having sig- 
nificantly different kinematic properties |3|- Predictions 
for the relative fraction of tt production from gg fusion 
range from 10% to 20% due to uncertainties in the parton 
density functions [3|, Ij] . A measurement of this fraction 
tests the SM prediction and our understanding of gluon 
parton distribution functions (PDF) in the proton. Dis- 
agreement with this prediction could reveal the possible 
existence of new mechanisms of top quark production and 
decay. For instance, production of top pairs at the Teva- 
tron could be affected by a new vector particle associated 
with topcolor [5|, |6| . Such a resonance would affect the 
angular correlations between the top and antitop, and 
the relative mixture of qq and gg initiated tt production. 
Additionally, new physics in the decay of the top quark. 
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such as a t ^ iJ+ 6, would also affect these correlations 

i. 

This Letter details the first measurement of the frac- 
tion Gf ~ <y{gg -^ tt)/<y(j>P — * ti) based on the kine- 
matics of ti production, its decay products, and their 
correlations. We use the ti event kinematics in an artifi- 
cial neural network (NN) to distinguish between the two 
modes of production. This analysis is described in detail 
in [7|. We use data in pp collisions at ^/s = 1.96 TcV 
collected by the multi-purpose Collider Detector at Fer- 
milab (CDF II) [1] from February 2002 to March 2006, 
corresponding to an integrated luminosity of 955 pb~^. 
The result of this analysis is then combined with a com- 
plementary measurement of this fraction [9| , which takes 
advantage of the higher probability for a primary gluon, 
compared to a quark, to radiate a low energy gluon in 
the production process. 

The production of ti is expected to be followed by the 
decay of each top quark to a H^ boson and a b quark 
with a branching ratio of approximately 100% [l|. We 
select events according to the topology of the ti decay to 
lepton-|-jets, in which one W decays leptonically and the 
other one hadronically, ti — > W^h W~h — > Ivi b q' qh. 
We require events to have an electron or muon candi- 
date with pt > 20GeV/c and \ri\ < 1, an imbalance 
of transverse energy of ^t > 20 GeV [10| as expected 
from the undetected neutrino, and four or more jets with 
Pt > 20GeV/c and \r]\ < 2 [l3|. A jet is defined as a 
cluster of energy in the calorimeter and is reconstructed 
using an algorithm with a fixed cone of radius 0.4 in r]-(f> 
space [ll|. To account for non-linearities in the detector 
response and multiple pp collisions in an event, we cor- 
rect jet energies and ^t UM- Furthermore, to increase 
the purity of the sample, at least one jet in the event 
is required to have a displaced vertex (6-tag), which is 



indicative of the likely 6-quark origin of the jet [13|. We 
find 167 candidate events with one 6-tag, and 65 candi- 
date events with two or more 6-tags that pass our event 
selection. 

Background processes in the ti candidate sample orig- 
inate primarily from direct Il^-l-jets production, with mi- 
nor contributions from diboson production {WW, WZ, 
ZZ), and multi-jet production (non-M^). The expected 
background estimates in the one b-tag and two or more b- 
ta gs ca tegories are shown in Table U and were determined 
in [iJI with 318 pb~^ of data and scaled to 955 pb""'^ of 
data. We also show the number of events observed in the 
data and the signal fraction (Sj). 

Using the herwig version 6.5 [l^ leading-order (LO) 
Monte Carlo (MC) generator, we simulate ti signal sam- 
ples for the two production processes with a top quark 
mass of 175GeV/c^. We model the dominant back- 
ground of ly-l-light and M^-l-heavy flavor jets events with 
the ALPGEN MC generator [ll|, using HERWIG to model 
parton showers. All generated events are passed through 
the CDF II detector simulation |8|. 



TABLE I: Number of expected background events in the one 
6-tag and two or more &-tags categories for an integrated lu- 
minosity of 955 pb~^. The number of events observed in the 
data and the signal fraction (S/) are also listed. 

1 6-tag > 2 6-tags 
Total background 27.3 ± 3.4 2.6 ±0.7 

data 167 65 

signal fraction Sf 0.84 ± 0.11 0.96 ± 0.17 



We consider only the four jets with highest transverse 
energy in each event and include all possible permuta- 
tions associating jets with partons consistent with the b- 
tag information. The reconstruction is performed using 
a kinematic fitter [17| which compares the jet-to-parton 
association to the tt hypothesis assuming the masses of 
the W bosons and the top quarks to be 81 GeV/c^ and 
175 GcV/c^, respectively. In the fitter the energy scale of 
the jets is varied according to its uncertainty. The agree- 
ment between each permutation and the tt hypothesis 
is quantified by the x^ value of the fit. For each event 
the permutation with the lowest x^ is used to extract 
kinematic variables as described below. 

We calculate eight variables that are sensitive to the 
production mechanism; two of these describe the pro- 
duction and the other six describe the decay of a given tt 
event. At leading order the tt production rate depends on 
two variables evaluated in the ti rest frame; the cosine of 
the angle between the top quark momentum and the di- 
rection of the incoming pro ton, cos 9* , and the top quark 
velocity relative to c, (3 jl8| . Since the functional form of 
the tt production rate to these variables is different for 
gg fusion than it is for qq annihilation, these variables 
contain information that could allow us to distinguish 
the production mechanism. The remaining six variables 
describe the tt decay and contain information about the 
correlations between the spins of the top quarks. These 
variables are the cosines of the angles with respect to the 
"off-diagonal" spin basis 18|, ll9| in the parent top-quark 



rest frame. One characteristic feature of this basis is that 
the number of qq — > ti events that have parallel top-quark 
spins as evaluated in this basis vanishes. Top pair events 
with parallel top-quark spins come exclusively from gg 
production. The decay variables are the cosines of the 
angles between the direction of the "off-diagonal" basis 
and the lepton, neutrino, leptonically decaying W, down- 
type quark, up-type quark, and hadronically decaying W. 
The distribution of data events for these variables shows 
very good agreement with the distributions from simu- 
lated background and ii events with Gf = 0.15. 

To obtain a single discriminating quantity the eight 
kinematic variables are fed into a NN. The NN used in 
this analysis [20| has an architecture of eight inputs, two 
hidden layers individually with ten and five nodes, and a 
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FIG. 1: Distributions of cos6* for events with one &-tag in 
data, expected if and background. The distribution of ii plus 
background uses the ratio of gg to total ti obtained from the 



fit of G 



fit 
f 



-0.075. We also show the expected distributions 



for gg-only and gg-only hypotheses where background is in- 
cluded. The error bars on the total ti + background includes 
the statistical uncertainty from Poisson statistics. 



single output. We train the NN to distinguish between qq 
and gg simulated ii events, with separate training for the 
one 6-tag events and for two or more 6-tags events. Re- 
ducing or increasing the numbers of layers and nodes does 
not significantly change the discriminating power. Ap- 
proximately one-third of the discriminating power comes 
from the production variable /3, one-third from cos 9* , 
and one-third from the remaining six decay variables. 
Figure [1] shows the distribution of cos 9*, one of our more 
sensitive variables, for events with one 6-tag in data, ex- 
pected ti, and background. 

In each 6-tag category we obtain three template distri- 
butions of the NN output, r««, T^s, and T'^'^a, by running 
the NN over qq produced ft, gg produced ft, and back- 
ground MC events, respectively. These templates repre- 
sent the probability for an event to have the NN output 
obtained assuming it was a ti produced by gg fusion, a 
ft produced by qq annihilation, or a background event. 

An estimator of the gg fraction in the sample, G"?, is 
obtained by maximizing a likelihood function. We calcu- 
late the likelihood of the full event sample for a given G? 
as 



£(g^) = /:Hg?)/:'(g^), 



(1) 



where C^ and C'^ are the one 6-tag and two or more 6- 
tags likelihoods, respectively. The individual likelihoods 



e defined as 




CiG'j) = exp 


'-{S}-SfY 
L S/ J 



11{{1-S})T^''^ 



S}[G^fTf^ + {l-Gf)n^]}, 



(2) 



where i ~ {1,2}, the product is over the events in the 
«th 6-tag category, the values of the signal fraction S/ 
and its uncertainty ct^ ^ are taken from Table HI and the 

variables 5*} represent the observed signal fractions in 
the sample. The overall multiplicative term represents 
a Gaussian weight centered at the expected signal frac- 
tion. Scanning over values of G"? we find the maximum 

'2 
/• 



likelihood solution by varying the fractions Si and 5* 
In a given sample the G? value for which the likelihood 



is maximum is called G V . 



The fitted fraction G^'* 



IS 

related to the true production fraction G/ by the accep- 
tance ratio of gg -^ tt to qq — > tt. Using HERWIG MC 
[l5| calculations we estimate the acceptance ratio to be 
1.29 ± 0.02 and 1.25 ± 0.02 for the one 6-tag and two 
or more 6-tags categories respectively. The value of G^ 
is not constrained to the physically allowable range be- 
tween zero and unity, and neither would be an estimate 
of Gf obtained from taking into account the acceptance 
ratio. To ensure a result for G/ in the physical range we 
use the Feldman- Cousins prescription (FC) [2l|, which 
maps any result of Gf to a range of the true fraction 
Gf. We generate this map by fitting for Gf in sim- 
ulated experiments with a known G/ and Sf . These 
simulated experiments are a mixture of gg, qq, and back- 
ground events keeping the total number in each experi- 
ment fixed to that observed in data for that 6-tag cat- 
egory. We fit the distribution of G l^ for the simulated 
experiments to a Gaussian shape for each value of Gf. 
The FC likelihood-ratio ordering principle 2l| is applied 
to the Gaussian obtained from the simulated experiments 
to construct the confidence level (C.L.) bands. 

To incorporate systematic effects into the FC prescrip- 
tion we generate auxiliary sets of simulated experiments 
chosen from signal and/or background samples designed 
to study various sources of systematic uncertainty. The 
difference of the mean of the G{* distributions gener- 
ated with the standard and the auxiliary simulated ex- 
periments is added in quadrature to the original width 
of the Gaussian distribution obtained with the standard 
sample. We repeat the procedure for each value of G/ 
and for significant sources of systematic uncertainties. 
The dominant systematic uncertainties result from un- 
certainties in the background shape and composition, and 
differences between LO and next-to-leading order (NLO) 
predictions estimated by comparing to a ti MC sample 
generated with the NLO generator mc@NLO [221. We 
evaluate the systematic uncertainty on this measurement 
due to parton distribution functions by using MC sam- 
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FIG. 2: Feldman- Cousins bands for the combination of the 
analyses with statistical and systematic uncertainties for 68% 
C.L. and 95% C.L.. In the data, we find G;^" = 0.073, which 
yields Gf = 0.07lj;j? and Gf < 0.38 at the 95% C.L. 



pies generated with MRST PDF's [231 and the full set of 
eigenvectors known as CTEQ6M from the CTEQ collab- 
oration [2J] . We also include sources of systematic effects 
arising from the jet energy scale [l2l | and initial and final 



state radiation [25 1. 

Finally, we evaluate the log likelihood for the data sam- 
ple and find the minimum of the negative log likelihood to 



bcG 



fit 
/ 



-0.075. The variables Sl and S"? for this value 



of G f^ match those given in Table [J within the uncer- 
tainties. The x^ goodness of fit test between the observed 
data values and the expected values at Gj results in 
X^/ndf = 0.9, indicating a good agreement between the 
observed and fitted distributions. For this value of G i^ 
the FC construction results in G/ < 0.33 at the 68% C.L. 
and Gf < 0.61 at the 95% C.L., respectively. 

This measurement is combined with the one performed 
in [9J also using 955 pb~^ of CDF data, in which the 
G i^ fraction is estimated from a fit to the distribution 
of the number of low transverse momentum charged par- 
ticles in the event by comparing the data distribution 
to those from gluon-originated and quark-originated pro- 
cesses. This analysis alone results in G {^ = 0.09 ± 0.16, 
where the uncertainty includes the statistical and sys- 
tematic uncertainties. 

We perform the combination using the FC prescrip- 
tion by including the track multiplicity information in 
the simulated experiments used for the evaluation of the 
FC bands. The statistical correlation between the event 
kinematics analysis and the track multiplicity analysis is 
found to be negligible. For each gg (qq) produced tt event 



in a simulated experiment, the value of the track multi- 
plicity for that event is chosen randomly from the gluon- 
(quark-)originated track distribution. Primary giuons 
are estimated to contribute to background processes in 
54±9% of the cases [9[. Therefore, for background events 
in a given simulated experiment the value of the track 
multiplicity is obtained from the giuon-originated distri- 
bution 54% of the time and from the quark distribution 
the remaining times. For each simulated experiment we 
evaluate the likelihood as a function of G? for the track 
multiplicity analysis using the goodness-of-fit to data for 
that fraction. We construct the combined likelihood by 
multiplying the likelihood of Eqn.[T]by the corresponding 
likelihood for the track multiplicity analysis. The distri- 
bution of Gf values of the combined likelihood is then 
used to construct the combined FC bands shown in Fig. [2] 
at 68% and 95% C.L. The value that maximizes the com- 



bined likelihood for the observed events is G 



fit 
/ 



0.073, 



indicated by the vertical arrow in Fig. [2l For this value 

0.07;^n7, and we find the 95% 



of Gf we measure Gf 



C.L. limit to be Gf < 0.38 US- 

To conclude, we report on the first limit of the fraction 
of gg produced ti events relative to the total by differenti- 
ating between the kinematic properties and their correla- 
tions of both production processes. Using this technique 
we limit the fraction Gf < 0.33 at 68% C.L. and find 
it to be consistent with SM expectations. The combi- 
nation with the measurement described in [9| results in 
Gf = 0.071q'|!)7, yielding the most stringent measurement 
by CDF of this quantity to date. 
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